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Abstract. Given a set S of bipartitions of some finite set X of 
cardinality at least 2, one can associate to E a canonical X-labeled 
graph B{Tj)^ called the Buneman graph. This graph has several 
interesting mathematical properties — for example, it is a median 
network and therefore an isometric subgraph of a hypercube. It 
is commonly used as a tool in studies of DNA sequences gathered 
from populations. In this paper, we present some results concerning 
the cut vertices of S(S), i.e., vertices whose removal disconnect the 
graph, as well as its blocks or 2- connected components — results 
that yield, in particular, an intriguing generalization of the well- 
known fact that B{T?j is a tree if and only if any two splits in S are 
compatible. 
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1. Introduction 

Consider a finite set X of cardinality at least 2. We denote by A the 
complement X—A for any subset A of X. And we call a bipartition S = 
{A, B} into a proper non-empty subset Aof X and its complement B = 
A a split or, more specifically, an X- split. For any non-empty collection 
S of X-splits, we define the Buneman graph i3(S) = to 
be the graph whose vertex set consists of all maps from S into 

the power set V{X) of X that satisfy, for all S, S' G S, the following 
two conditions 

(BGl) (f){S) G S, i.e., if ^ = {A, B}, then (j){S) = A or (p{S) = B, and 
(BG2) <f){S) n (j){S') ^ 0. 

And we define its edge set -E(S) to consist of all those subsets {0, ip} 
of V^(S) for which the difference set A{(f),ip), defined by 

A(0,^):={5gS : <j>{S)^ij{S)}, 

has cardinality 1. 

We also denote by V*{'E) the superset of V^(S) consisting of all maps 
: S — )■ V{X) that just satisfy (BGl) (but not necessarily (BG2)). 
And we define the extended Buneman graph B*{T.) = (V* (T.) , E* (T.)) 
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to be the (necessarily connected) graph with vertex set and edge 

set £'*(S) defined exactly as -E(S) above, yet with replaced by 

V^*(E) in its definition. Note that is clearly isomorphic to an 

|E|-dimensional hypercube. 

An example of a Buneman graph is pictured in Figure [H Note that 
some of the vertices of the graph are labelled by elements in X. This 
arises from a canonical labelling map (fj::X-^ ^(S) '■ x ^-^ (px 
mapping X into the vertex set of where, for each x G X, 0^. 

denotes the map in V(T,) that associates, to any S E T,, that subset S{x) 
of X in S" that contains x. Clearly, this map is necessarily contained in 
In particular, the cardinality of V(T,) must be at least 2 for any 
non-empty collection S of X-splits. 
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Figure 1. For X := {1,2,..., 8}, the Buneman 
graph Bills) for the collection Eg of X-splits given 

by Eg := {5*13, 5*12, >S'i23, 5*1235; 'S'45, S'i234, 'S'eT, 5*78, S's} 

(where, for example, S'13 denotes the X-split 
{{1, 3}, {2, 4, 5, 6, 7, 8}}). The vertex is a cut- vertex of 

The Buneman graph has appeared in the literature in various guises: 
As a co-pair hypergraph in [H |2], as a special type of median graph in 
e.g. [3] (see also [20] for a review of median graphs), and in the above 
form in [12]. In addition, Buneman graphs and median networks are 
regularly used to help analyze viral or mitochondrial sequence data 
gathered from populations (see e.g. [^ fTO])- 

It is a well-known basic fact essentially established by Peter Buneman 
in 1971 (cf. [8]) that S(E) is a tree if and only if any two splits S, S' in 
E are compatible, i.e., if and only if, for any two splits S = {A, B}, S' = 
{A', B'} in E, one of the four intersections AnA',AnB',BnA',BnB' 
is empty. 

Here, we will generalize this fact in a rather natural way as follows: 
Given any split system E, let vro(E) denote the set of connected com- 
ponents of the so-called incompatibility graph r(E) of E, i.e., the graph 
with vertex set E whose edge set consists of all pairs of splits contained 
in E that are not compatible. For example, for Eg the collection of 
splits in Figure [H r(E8) consists of five cliques, viz {S'13, 5*12}, {'S'123}, 
{'S'1234, S'i235; 'S'45}, {SqtjSjs}, and {5*5}. 
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Further, let T(S) denote the (by construction necessarily bipartite) 
graph with vertex set V(S) the disjoint union of V(T,) and vro(S) and 
edge set SiT,) the set of all pairs {Eo,0} with Sq G 7ro(S),0 G V(T,), 
and 0(5') n 7^ for all S" G S — Eq and all Aq G for some — 
or, equivalently, every — split 5*0 G Sq. Then, this graph T(S) = 
(V(S),£^(S)) is always a tree (Theorem 16. ip . For example, the tree 
T{T,s) for the split system Eg referred to in Figure [1] is presented in 
Figure [2] (a) in the last section. 

To establish this fact, we will first introduce appropriate notations 
and discuss some generalities in Section [2l Then, in the next section, 
we will study the cut vertices of i3(S), that is, the vertices G V(T,) 
of -B(S) for which the induced subgraph B^'^\T.) of i3(S) with vertex 
set ViT,) — {(f)} is disconnected (see e.g. Figured]). Remarkably, these 
vertices can be characterized in quite a few equivalent ways (cf. The- 
orem UT]). In particular, given any map G V^(S), one can define as 
many as at least eight significantly distinct graphs all of which have the 
property that there is a canonical one-to-one correspondence between 
their connected components and the connected components of B^'^\T,). 
So, a map G V{T,) is a cut vertex if and only if any of these graphs 
is disconnected. 

In Section [5l we shall use the cut vertices to provide an explicit 
description of the blocks (or two-connected components) of -B(S) [9], 
that is, the maximal subsets V of V{T,) for which the graph induced on 
V is connected and does not contain a cut vertex. More specifically, we 
will show that there is a canonical one-to-one correspondence between 
7ro(E) and the set Bl{T,) of blocks of B(J2) given by associating, to each 
connected component Sq G vro(S) of r(S), the set -B(So) consisting of 
all maps G V{T,) with {Eo,0} G S{T,), see Theorem 15.11 for details. 
Thus, for example, we see that in Figure [1] the Buneman graph has 5 
blocks that correspond to the 5 cliques in the associated incompatibility 
graph. 

And finally, in the last section, we will establish the above-mentioned 
generalization of Peter Buneman's result from 1971. In addition, we will 
establish some refinements that will allow us to associate, to any split 
system S, a "proper" X-tree T^, i.e., a triple {Vs, E^; ^t,) consisting 
of a tree with vertex set Vs and edge set E^:, and a labelling map 
^j: : X Vs such that the degree of any vertex in Vs that is not 
contained in the image of $e is at least 3. For example, the tree 
for the split system Eg in Figured] is presented in Figure [2] (b). 

Note that besides providing important structural information con- 
cerning the Buneman graph (and median networks — see e.g. [7]), cut 
vertices have been used to help in the computation of most parsimo- 
nious trees for DNA-sequence data (see e.g. [U El [22])- And the blocks 
of B{T,) determined by them are also closely related to the "blobs" de- 
fined in terms of directed graphs described by D. Gusfield et al. in [18] 
(see also e.g. [lUdn]). 
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In [15], we will use the results described above to show that -B(S) 
can be described as a push-out in terms of the data involved in the 
construction of T(S), and discuss further potential applications. 



In this section, we will review some results concerning the Buneman 
graph that will be needed later on. Wherever appropriate, we will refer 
the reader to the relevant literature for proofs of the results that we 
state. We also refer the reader to [9] for the basic terminology of graph 
theory that we will use throughout this paper. 

First, we define, for every map G V*(T,) and every subset H of S, 
the map 0^ G V*(T,) by putting 



for every split 5 G S. Note that, writing 0"^ rather than 0^ in case 
S consists of a single split S, only, one has {4>,ip} G £'*(E) for some 
0, "0 G if and only if -0 = 0"^ holds for some (necessarily unique) 

split 5* G S, f zz the unique split S = S^^^ in A(0, -0). Note also that, 
for a fixed map G V*{Y?) and any other map if) G one clearly 

has 0^('^'^) = if). In particular, if) is completely determined by and 
the difference set A(0, ^/i). 

Using other notations and arguments, the following result has also (at 
least implicitly) been shown in, e.g., [211 Chapter 3.8] and [2l Chapter 
5.1]. It is also related to [HI Theorem 1]. 

Lemma 2.1. Given a vertex in V{T?) and a split 5 G S, the following 
three assertions are equivalent: 

(i) 0(5*) is contained in the seimin(0[S]) of inclusion-minimal sub- 
sets in the image 0[S] o/S relative to cf), 

(ii) the map 0'^ G V*(S) is, in fact, a vertex in V{T,) and thus 
forms, together with 0, an edge in E{T?), 

(iii) there exists some vertexip in V{T,) with(f){S) G min(0[A(0, -0)]). 

Proof: (i) =^ (ii): Suppose 5* G S with 0(5") G min(0[S]). To see that 
0"^ G V(Tj) holds, note that, by definition, we have 0'^(5'i) fl 0'^(5'2) = 
0(51) n 0(5*2) 7^ for all 5*1, 5*2 G S that are distinct from S. Moreover, 
we have 4>^{S') n 0^(5) = 0(5') n 0(5) ^ for all 5' G S - {5} 
as 0(5") n 0(5) = would imply 0(5') C 0(5) in contradiction to 
0(^) G min(0[S]). 

(ii) =^ (iii): This is trivial: Just put ip := 0"^. 

(iii) =^ (i): This follows immediately from the following, slightly more 
general observation: 

Lemma 2.2. Given any two vertices (f),ip in -B(S), the set 0[A(0, -0)] 
is an ideal in the partially ordered set 0[S], that is, Ai,A2 G 0[S], 
Ai G (f)[A{(f),ip)], and A2 C Ai implies A2 G 0[A(0,?/')]. 



2. Preliminaries 
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Proof: Indeed, denoting by Si and 5*2 the two splits in S with 0(5*1) = 
Ai and 4>{S2) = A2, respectively, we must have 5*1 G A{(f),ip). Thus, 
our assumption A2 Ai = (p{Si) implies = fl 0(5*1) =742 1"! ip{Si) 
and, hence, '0(5'2) A2 = 0(52), i.e., 52 G A(0, -0), as claimed. 

This finishes also the proof of Lemma 12.11 1 

Next, note that, by definition, i3(S) is clearly the induced subgraph 
of i3*(S) with vertex set V^(S), and the graph-theoretical distance 
D*{ip,%p') between any two vertices ip and ip' in i5*(S) coincides with 
the cardinality of their difference set A('0,'0'). 

It follows that i3*(S) is, in particular, a median graph. I.e., there ex- 
ists, for any three maps 0i, 02, 03 G V^*(S), a unique map me(i(0i, 02, 03) 
in l^*(E), dubbed the median of 0i, 02, and 03, that lies simultaneously 
on (i) a shortest path joining 0i to 02, (ii) a shortest path joining 0i to 
03, and (iii) a shortest path joining 02 to 03. It maps every split 5* G S 
onto 



(1) merf(0i,02,03)(5') 




if 0i(5)G {02(5), 03(5)}, 
otherwise. 



Parts of the following corollary have also been observed in, e.g., [H 
Section 2.1], |1, Proposition 1], 0, and [21] : 

Corollary 2.3. The Buneman graph i3(S) is, for every split system S, 
a connected isometric and median subgraph of the extended Buneman 
graph i3*(S) associated with S. That is: 

- The median me(i(0i, 02, 03) of any three maps 0i,02,03 G ^(S) is 
also contained in V{I1). 

- The graph-theoretical distance D{iIj,iIj') between any two vertices i(j 
and ip' in 'S(S) is finite and coincides with the distance D*{iI),iIj') be- 
tween ip and ip' in the larger graph S*(S).- I.e., there exists, for any two 
vertices ip, "0' £ ^(S); sequence ipo '■= ■ ■ ■ y'lpk '■= "0' of maps in 
K(S) of length k := |A('0,'0')| such that \A{ipi_i,ipi)\ = 1 holds for all 
i = 1, . . . , k. 

More specifically, there exists a canonical one-to-one correspondence 
between the set consisting of all such sequences ipo = ■ ■ ■ , '0|A(V',V'')I " 

ip' and the set of all linear orders defined on '0[A(-0, -0')] that ex- 
tend the partial order of tp[A{ip,tp')] defined by set inclusion. 

Furthermore, the following holds: 

(i) Given any three vertices ■ip,ip',(p ^ ^(^); every shortest path 
■00 := "015 • • • 5 0'|A(V',V'')l connecting ip and ip' in i3(S) 

must pass through if and only if one has A{ip, ip') = A{ip, 0) U 
A(0, -0') and ip{S) C 0^(5") for every S G A(-0,0) and 5' G 
A(0, -0'). And, conversely, there exists a shortest path ipo : = 
■0, -01, ... , '0|A(V',V'')l V^' connecting ip and ip' in i3(E) not pass- 
ing through if and only if there exist splits S G A{ip,(p) and 
S' G A{iP', 0) with 0(5) U 0(S') ^ X. 
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(ii) Any cycle {V'o, V^i}, {i'l, ^2}, • • • , {^n-i, i'n ■= i'o} in -E(S) that 
is not the "mod-2 sum" - or symmetric difference - of cycles of 
smaller length, is of length 4: (as this is easily seen to hold for 
any median graph). 

Proof: As, in view of ([T]), there exists, for all (/)2, </)3 G V*{Tj) and all 
5*, 5" e E, an index i G {1,2,3} with med{(f)i, (j)2, 4>3){S) = (pi{S) and 
me(i((/)i, (/)2, 03)(5") = (pi{S'), it is obvious that med^cpi, (f)2, (ps) G V(T,) 
holds for any three maps 0i, 02, 03 G V(T,). 

Further, with '0,'0' G V^(S) as above, select a split S G A('0,'0') such 
that '0(5') G min-0[A('0, -0')] holds, and put ipi '■= i^^- Clearly, 0i G 
nS), M0i} G E{E), A(^,0.i) = {^}, and A(0^i,^') = A(0,0') - 
{S*} and, therefore, -01 [A (-01, -0')] = '0[A('0, ■0')] — {■0(5')} also holds. 
Thus, in view of Lemma I2.H the first assertion in the statement of the 
corollary follows easily using induction with respect to |A('0,'0')|. 

The remaining assertions now follow from this assertion: It implies 
that, given any three vertices ip, G ^(S), the following two claims 
are equivalent: 

(i) Every shortest path i/jq := 0, 0i, . . . , 0|a(V',V'')I connecting 
and ip' must pass through 0. 

(ii) Both A(0,0^') = A(0,0) U A(0,0') and 0(5) ^ ^{S') hold, for 
all S G A(0, -0) and 5" G A(0, -0'), for every linear order defined 
on ip[A{'ip,ip')] that extends the partial order of 0[A(0, 0;)] defined by 
set inclusion. I 

Next, we associate to every edge e = {0,0} G E*(T,) its type ^(e) 
that we define to be the unique split ^(e) := S^^^ in A(0, 0;). This 
clearly yields a surjective map k, : E*(T,) — t- S whose restriction to 
-E'(S) we denote by ke. 

For example, the 4 horizontal edges of the small cube in the middle 
of Figure [Hall are of type S'1234, and the 4 parallel edges in that cube 
going from SW to NE are all of type 5*1235. 

Clearly, one has A(0,0) = {K({0j_i, 0j}) : i = 1, . . . , |A(0, 0^)|} for 
every shortest path 0o := 0, 0i, . . . , 0|a(</>,'!/))| := 0^ from a vertex to a 
vertex ip in -B(S) or i3*(S). We claim (see also pTl Exercise 11, p. 64]) 

Lemma 2.4. The set k^^{S) is, for every split S = {A, B} G S, a 
cutset of i3(S) "inducing" the split S . That is, removing the edges 
in K^^(S) from i3(S) yields a subgraph with precisely two connected 
components, one denoted by B(Tj\A) containing all the vertices 0a with 
a & A, and the other one denoted by B{Tj\B) containing all the vertices 
0f, with h & B. Furthermore, the same holds (mutatis mutandi) for 
B*{T?j and the subset k"^{S) of E*{1:). 

Proof: Let S = {A, B} be an arbitrary split in S. It suffices to show 
that the two subsets i3(S|A) := {0 G ^(S) : 0(S) = A} D {0^ : a G A} 
and i3(S|S) := {0 G V(S) : 0(5) = B} D {(^b : b e B} of V(S) are 
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(exactly the two) connected components of the subgraph of i3(S) ob- 
tained by removing the edges in k^^{S). Indeed, it follows immediately 
from the definition of E{I1) that every path in from a vertex in 

;B(E|74) to a vertex in B{T,\B) must contain at least one edge in k^^{S). 
Moreover, in view of Corollary I2.3[ no shortest path between any two 
vertices in (and, analogously, in B(T,\B)), can pass through an 

edge in k"^(S'). And, clearly, the same argument works just as well for 

Note that restricting the maps in V{I1) to a given subset S' of 
S clearly induces a surjective graph morphism from to 
That is, it yields a surjective map from l^*(E) onto that maps 

every edge e in £^*(E) either onto a single vertex (if K{e) G S — S') in 
V*{T,') or onto an edge in E*(J2') (if ^(e) e S'). We now show that 
it also induces a surjective graph morphism ress^-s' from i3(S) onto 
-B(S') (see also [12]): 

Lemma 2.5. Given a subset S' of a split system S on X , and a map 
ip G V(S'), there exists {at least) one extension ip of ip in V(Tj), i.e., a 
map (f) E V{T.) with = ^. 

Moreover, the resulting surjective graph morphism ress-j^s' contracts 
every edge e G -E(S) that is of type K,{e) G S — S' onto a vertex while 
it maps every edge e G -E'(E) of type K,{e) G S' onto an edge e G E{T,') 
of the same type. 

Proof: Using induction with respect to we may assume, without loss 
of generality, that S = S' U {S} holds for some single split S = {A, B} 
in S — E'. Then, at least one of the two extensions ipA,'4'B of ip in 
V*(T,) defined by putting '0^(5') := A and ipsiS) := B must be con- 
tained in V(T,): Indeed, if ipA ^ V*(T,) would hold, there would exist 
some S' G S' with ipAiS) fl V'a(5") = An ipiS') = 0. In this case, how- 
ever, '^B{S)niljBiS") = Bn'pj{S") would hold for all S" G S' since 
Br\ij{S") = for some S" G S' would imply '^{S')nilj{S") C BnA = 
in contradiction to E V{T,'). So, ^ipB ^ V*{T,) must hold in this case, i 

Corollary 2.6. Given any two distinct splits S, S' in a split system E, 
there exist always two maps 'p,'p' G V"(S) with S, S' G /\{'p,-p'). 

Proof: This follows directly from the last lemma as it obviously holds 
in case S = {5, 5"}. I 



3. Some Graph-Theoretical Observations 

To establish our main results, we will make use of the following simple 
and purely graph-theoretical observations: 
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Suppose that U and V are two sets and that R U x V is a binary 
relation. Let T{R) denote the bipartite graph with vertex set the "dis- 
joint amalgamation"0 UU.V of U and V, and edge set E{R) := v} : 
u E U,v G V, {u,v) G R], and define graphs T{R\U) := {U, [{ui,U2} E 



Q : 3,^v{u,,v),{u2,v) e R}) and T{R\V) := {V,{{v,,V2} G Q : 



^uGu{u,Vi), {u,V2) G R}). Let TioiR), 7ro(-R|f/) and 7ro(-R|V^) denote 
the connected components of T{R), T{R\U) and T{R\V), respectively. 
Then, the following holds: 

Lemma 3.1. Given two sets U and V and a binary relation R U xV 
such that the associated bipartite graph T{R) has no isolated vertices, 
the embeddings ljj : U U 11V : u ^ u and Ly '■ V ^ U 11V : v ^ v 
induce bijections between the sets 7io{R\U) and no{R\V) and the set 
71q{R) of connected components of the graph T{R). I.e., with obvious 
notational conventions, we have a commutative diagram of bijections: 



In particular, given two subsets A G 71q{R\U) and B G 7ro(-R|l^) ofU 
and V , respectively, the following assertions are equivalent: 

(i) tiq{U ^ V){A) = B, 

(ii) {Ax B)r]R^%, 

(iii) B = {veV ■.^aeA{a,v)ER}, 

(iv) A = {MGt/:3feeB(M,6) Gi?}. 

Proof: All this is quite obvious: The maps 7ro(<.;7) and nQ^Lv) must be 
surjective as T{R) is supposed to have no isolated vertices. And they 
must be injective because any path in T{R) connecting two vertices 
Ui,U2 G U (or fi,f2 G V) gives rise to a path connecting these two 
vertices in V{R\U) (or in V{R\V), respectively). i 

Now, assume that U' and V are two further sets, and that a : U' U 
and [3 -.V ^ V are two maps such that there exists 

(Ml) for all Ui,M2 G U and v eV with (mi, t>), (m2, t>) G R, some 
v' G V with (Mi,/3(f')), {u2,l3{v')) G R and, symmetrically, 

(M2) for all M G ?7 and fi,f2 ^ V with (u, t>i), (m, t>2) G R, some 

u' G U' with («(«'), fi), («(«'), f 2) G R. 
Then, defining the binary relations 



Here, we make use of the fact that, according to category theory, the disjoint 
amalgamation of any two sets U and V is weU defined up to canonical bijection 
whether or not U and V are disjoint, and can be constructed e.g. by considering 
the subset of the set (C/UF)x{l,2} consisting of ah {w,i) G (C/ x {!}) U (V^ x {2}). 



Tl,{R\V) 





MR) 



R' ■= ■= {{u',v') eU' xV : {a{u'),f3{v')) G R} 
R^ ■= {{u',v) eU' xV : {a{u'),v) G R}, 
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and 

Rp := {{u,v') eU yiV : {u,(5{v)) G i?}, 
it is easily seen that the following holds: 

- The graph V{Ra\V) coincides with the graph V{R\V). 

- The graph r(i?^|?7) coincides with the graph V{R\U). 

- The graph T{Ra\U') coincides with T{R'\U') as well as with the 
graph induced by V{R\U) and a on U' . 

- The graph r(i?^| V) coincides with V{R'\V') as well as with the 
graph induced by V{R\V) and {3 on V . 

Furthermore, all the corresponding maps must induce bijections on the 
level of connected components, i.e., we have 

Corollary 3.2. Continuing with the assumptions introduced above as 
well as in Lemma lSTll and using obvious notational conventions, we have 
the following commutative diagram in which all maps are bijections: 



MR'\u' 




MR\u) 



7ro(C/'-i?') 

^' vro(i?' 




MR) 



7ro(f7<™»y) 




MR\v) 



In particular, given four subsets A' G 7ro(-R'|f/'), A G 7ro(-R|t/), 
B' G no{R'\V'), and B G no{R\V) ofU',U,V' and V, respectively, 
the following holds: 

(i) 7ro(f/' ^ V')iA') = B' ^ {A' X B') n i?' ^ ^ B' = 
{v'eV':3a'eA'{a',v')eR'} 

(ii) 7ro(a)(A') = A ^ a{A') n A ^ ds ^ a{A') C A ^ 
A = {u E U : 3a'(zA',v<^v{u,v) G R and (a',t>) G Ra} 

(iii) (7ro(/3)o7ro(f/' ^ = ^ ^ x fi)ni? ^ ^ 
B = {veV ■.3a'eA'{a{a'),v)eR} 

Proof: It follows from Lemma 13.11 that all non-vertical maps in the 
above diagram must be bijections. Moreover, if either a or /3 is the 
identity map, at least one of the vertical maps must also be a bijection 
in which case all maps must be bijections. So, the general case follows 
by concatenating the diagram obtained for the pair (idc/,/3) with the 
diagram obtained for the pair (a,idy') and applying what we know 
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about the individual binary relations R',Ra, and R. 



4. Some Characterizations of Cut Vertices 

In this section, we will provide some characterizations of the cut 
vertices in the Buneman graph. Some of these closely resemble the 
characterization of cut points in the tight span of a metric space in 
terms of certain finite graphs given in [13]. More precisely, we will 
establish the following result: 

Theorem 4.1. Assume as above that X is a finite set, that H is a 
system of X -splits, and that (p is a map in V"(S). Then, the following 
assertions are equivalent: 

(i) (j) is a cut vertex ofBiJl). 

(ii) There exists a hipartition of'Z^'^'^ := {S E : 0(5") G min(0[E])} 
into two disjoint non-empty subsets Tj^f^ and Tj''^'^ such that any 
two splits Si G S^"^'' and S2 G T,^^^ are compatible. 

Ill has a non-empty intersection with at least two distinct con- 

nected components of the incompatibility graph r(S) of S, as 
defined in the introduction. 

(iv) There exists a bipartition of 11 into two disjoint non-empty sub- 
sets Si and S2 such that 0(5*1) U <f){S2) = X holds for any two 
splits Si G El and 5*2 G S2. 

(v) There exists a bipartition of X^'^^ := {x G X : 7^ 0^} into two 
disjoint non-empty subsets Xi and X2 such that Xi C 0(5') or 
X2 C 0(5) holds for all S E T, or, equivalently, for all 5 G S^*^-* . 

(vi) There exists a bipartition ofV^'^^ = V^'^^T,) := V{T,) — {0} into 
two disjoint non-empty subsets Vi and V2 such that A(0, -01) fl 
A(0, -02) = holds for all ipi G Vi and ip2 & V2- 

To establish Theorem 14.11 we apply Corollary 13.21 as follows: With 
X, E, and as in the theorem, we put [/ := E, 1/ := V^^\ U' := S^"^), 
V := X^^\ Further, we denote by a the embedding of T,^'^^ into S 
and by (3 the map X^"^) V^(<^) : x ^ (p^. And we put R := R^^^ : = 
{{S,ij) G S X : S G A(0,V^)}. 

It is easily checked that all the requirements needed for applying 
Corollary 13.21 are satisfied. For example, the maps a and /3 satisfy 
Properties (Ml) and (M2): Indeed, given two maps "01, "02 £ V^'^\ and 
some split 5 G S with 5 G A(0, -01) fl A(0, -02), there exists a split in 
S' G with S' G A(0,0;i) n A(0,0;2) as 0i(S) = 02(^) 7^ 0(5) 3 
(f){S') for some S,S' eJ: implies 0(5') n ^i{S) = 0(5') n MS) = 
and, hence, MS') = MS') 7^ 0(5'). And if 5,5' G A(0,^) holds 
for some 5, 5' G S and ip E V'^^\ then ip{S) n ip{S') ^ implies that 
there is some x E X^'^^ with 5", 5' G A(0, 0^.) as this must hold for any 
x E ij{S)nij{S'). 
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Now, with W denoting any of the sets T,^'^\X^'^\ S, and V^'^^ or their 
cartesian products x X(<^),S(<^) x x X^^\ or S x I/W, let 

Tii,{W) denote the corresponding graph with vertex set W whose edge 
set E^{W) is defined in terms of the binary relations R''^\ rI^\ R^i^\ 
and i?^^^ - so, for example, the edge set E^CE) of T^CE) is the set 

E^iE) = {{S^,S2} E ■ 3v,gy(« ^1,^2 e A(0,V^)}. 

Note that in what comes below, we shall derive explicit descriptions of 
the edge sets of the graphs r<^(S), r^(SW), r^(X('^)), and T^{V^^^) in 
dSD, ©, dl]), and ([5]), respectively. It may be helpful for the reader to 
take a look at these explicit descriptions before proceeding. 

In addition, to further simplify notation, put ■Ti(f){W) := 7ro(T^{W)) 
and, for any pair of distinct vertex sets W, W as above, denote by 
n^(W' -w W) the induced bijection from 7i^{W') onto iiipiW). 

Then, Corollary 13.21 yields the following diagram of canonical bijec- 
tions: 




Now, note that the graph T^{V^'^^) contains the induced subgraph 
of -B(S) = with vertex set l/^"^) — the graph that, 

by definition, is disconnected if and only if is a cut vertex (of B(T,)): 
Indeed, any edge {■ilj,ip'} G E(E) with ifj^xfj' ^ cf) must also be an edge 
in r0(VW) as {V',^'} e E{E) implies that either A(V^,0) C A(^',0) 
or A(-?/'', 0) C A('?/', 0) must hold. So, ipyip' ^ 4> implies that some split 
S e A(^, 0) n A(^', 0) must exist. 

Furthermore, the embedding of B^'^\T,) into r^(V^'''^'') induces a bijec- 
tion between the corresponding sets of connected components ttq (^B^^^ (S)) 
and 7r<^(V('^)) (the latter being the set in the bottom right corner of the 
above commutative diagram): Indeed, given any two maps ip, -0' G V^'^^ 
that form an edge in T(i,(y^'^^), there must exist some split S E A(-0, 0)n 
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A('?/^', 0) implying that S G A{ipi, 0) and, hence, that also ipi ^ (p must 
hold for every map ipi in any shortest path ipQ := . . . ,ipk '■= 

from ip to ip' in Therefore, ip and '0' must also be contained in the 

same connected component of B'^'^\YP). In consequence, any connected 
component of T(i,{y^'^'^) must be contained in and, hence, coincide with 
connected component of B'^'^\Tj). 

Thus, in view of the above diagram, a map G V(T?) is a cut vertex 
of -B(S) if and only if either one of the eight graphs T^{W) with W = 
SW, XW, S, X X^-^), S^-^) X l/W, S X X^"^), or S X F^'^) as above 

is disconnected. 

So, denoting the connected component of Tfi,{W) containing a given 
vertex w & W hj FfplW, w), Theorem 14 . 1 1 follows immediately from the 
following observations: 

(i) <^=^ (iv): The definition of the edge set E^CE) implies that two 
distinct splits S", S" G S form an edge in E^^E) if and only if there is 
some X G (p{S) ft 4>{S'). I.e., we have 

(2) = {{5, S'} G ■■ <f>iS) U 0(^0 ^ X}. 

So, r^(E) is disconnected if and only if there exists a bipartition of S 
into two disjoint non-empty subsets Si and S2 such that 0(S'i)U0(S'2) = 
X holds for any two splits 5*1 G Si and 5*2 G S2. This establishes the 
equivalence of (i) and (iv) in Theorem 14. 1[ 

It follows also that the incompatibility graph r(E) of S is a sub- 
graph of r<^(Il). So, r<^(S) must be connected for every map G V{E) 
whenever the incompatibility graph r(S) is connected. In consequence, 
B^'^\T.) must be 2-connected in this case, and, more generally, ev- 
ery connected component in 7ro(S) = 7ro(r(S)) of r(S) must be con- 
tained in a connected component of r<^(E). Equivalently, every con- 
nected component of T^(T,) is a disjoint union of connected compo- 
nents of r(E). The corresponding canonical surjection from 7ro(E) onto 
7r<^(E) = 7ro(r0(E)) that maps any connected component r(E,S') of 
r(E) containing a split S* G E onto the corresponding connected com- 
ponent r0(E,5') of T^{T,) containing S will henceforth be denoted by 

(i)<^==^(ii): This follows immediately from the following observation: 

Lemma 4.2. With X, E, and as above, two splits S, S' G E*^*^) form 
an edge in the induced subgraph r<^(E'^'^)) ofT^{T.) with vertex set E^*^) 
if and only if they are incompatible. I.e., we have 

(3) ^^(E^'^)) = {{S,S'} e i 2 j ■■ S andS' are incompatible}. 

Proof of Lemma: Indeed, if two distinct S, S' G E*^"^) are incompatible, 
they form an edge in T^(T,). Conversely, if they form an edge in r0(E), 
i.e., if 0(5") n 4>{S') 7^ holds, they must be incompatible. This follows 



BLOCKS AND CUT VERTICES OF THE BUNEMAN GRAPH 13 

as 0(5') n 0(5") 7^ holds for any two splits 5', 5" G S in view of 
G V{T,), and 0(5) fl 0(5') 7^ holds for any two distinct splits 
5 G and 5' G S. I 



(i) <^==^ (iii): Lemma [4.21 implies that T(i,('Z^'^^) can be viewed as the 
induced subgraph of the incompatibility graph r(E) with vertex set 
j^W c E. This implies that the embedding S^*^^ — )■ E induces a well- 
defined map 7r<^(E('^)) ^ 7ro(E) : r<^(E('^), 5) r(E, 5) that we denote, 
for short, by 7Cfp{>-^). Moreover, the composition 

vr0(^) o 7r^(^) : 7r^(E('^)) ^ 7ro(E) ^ 7r^(E) 

maps, for any split 5* G the connected component r0(E('^\ 5") onto 
the connected component r0(E, 5') and, hence, coincides with the bijec- 
tion 7r<^(E(<^) ^ E) : T^{J:^'^\S) ^ T^{J:, S) induced by the embedding 
a of E^*^) into E. In turn, this implies 

Lemma 4.3. The map tt^{>-*) is always injective, that is, any two splits 
5, 5' G E^*^^ can be connected by a sequence of pairwise incompatible 
splits in E^*^) whenever they can be connected by such a sequence of 
pairwise incompatible splits in E. 

This clearly establishes that (i)<^=^(iii) holds, as claimed. 

(i) <^=^ (v) <^==^ (vi): We now establish the equivalence of the first 
and the last two assertions in Theorem 14.11 It suffices to note that two 
elements xi, X2 in are not connected by an edge in T(i,{X^'^^) if and 
only if xi G 0(5") or X2 G 0(5") holds for all 5" G E or, equivalently, for 
all 5 G E^-^). I.e., we have 

(4) i?^(X(^)) 

= {{Xi,X2}ei 2 )• 35eS rorSW); ^1^2:2 ^0(^)} 

= {{xi,X2}ei 2 j : ^(0x.i, 0^2) < ^(0X1,0) + ^(0,0^2) }• 

Moreover, two maps ipi,'4'2 in V^^^ are not connected by an edge in 
T^{V^^^) if and only if 0(^) = i^i^S) or 0(5) = ^2(^) holds for all 
5 G E or, equivalently, for all 5 G E*^*^). I.e., we have 

(5) E^(\/W) 

= {{^1,^2} ^[ 2 )'■ ^^^^ (-^''') ^^^^^ = ^'^^^ ^ '^^^^^^ 
This finishes the proof of Theorem 14.11 I 
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It is worth noting in this context that our approach imphes also that, 
given a map cf) G V^(S), the following assertions are equivalent: 

(i) is a cut vertex. 

(ii) There exists a bipartition {Si, S2} of S and a bipartition {Vi, V2} 
of V^^^ such that 0(^) = i){S) holds for all 5 G Si and ^ G V2, 
and for all S' G S2 and ip &Vi. 

(iii) There exists a bipartition {Si,S2} of S'-'^-' and a bipartition 
{Xi,X2} of X^^^ such that x G holds for all 5 G Si and 
X G X2, and for all S* G S2 and x G Xi. 

For example, for the map in Figured! the bipartition of Sg is {{S'e?, 5*78}, S— 
{S'67, 5'78}}. Moreover, we have S(<^) = {S'1235, 5'i234 , 5*45 , 5'78, S'67}, and 
the corresponding bipartition is given by {{5'67, 5'78}, S^*^) — {S'67, 5'78}}. 

It is also worth noting that the various images of the connected com- 
ponents in the sets ti^{...) relative to the respective bijections considered 
above can be described as follows. For (f),ip E K*(S), we put 

Ar^Um := {S G A(0,^) : ^(S) G min(^[A(0, V^)])}, 

where min(?/^ [A (0, ?/')]) denotes the set of (inclusion-) minimal subsets 
in the image '?/'[A(0, ■?/;)] of A(0, -0) relative to ip. 

Proposition 4.4. Given any four connected components Sq G 'k^(Tj^'^^), 
So G 7r0(S), Xo G 7r<^(XW), and Vq G tt^{V^^'^), the following holds: 

(i) 7r<^(SW ^ S)(S^) = So ^ n ^ So ^ S^ = So n 
SW ^ So = {5 G S : 4>{S) U 0(5') ^ X for some S' G S^,}. 



m 



7r^(SW ^ XW)(S^) = Xo ^ Xo = {x G X^ : A^in(0|0.) C 
Sa ^ Xo = {x G XW : A^in(0|0.) n S^ 7^ 0} ^ S[, = 



*min 



IV 



VI 



7r<^(SW ^ F(^))(S^) = K) ^ K) = G V^'f'^ : A^i„(0|^) C 
So) ^ K, = G : A^in(0|V^) n S^ 7^ 0} ^ Vo ^ 
{0^ : 5 6 S;,} <^ K = U^evo ^min(0|V^) ^ = {S G 
SW : 0'5 G 1/0}. 

7r^(S ^ XW)(So) = Xo ^ Xo = {x G X^ : A(0,0,) C 
So} ^ Xo = {x G XW : Sq n A(0,0,) ^ 0} ^ So = 

7r<^(S <^ F(^))(So) = ^ Vo = {^IJ e : A(0,^) C 

So} ^ = {V^ G : A(0,V^) n So 7^ 0} ^ So = 
U^,^,A(0,^). 

7r<^(XW ^ VW)(Xo) = K) ^ Vo = {^e : /^(^,0.) < 
D{ip, 0) + D(0, 0^) for some x G Xq} ^o = e V^^"^) : 

A(0, 0^.) n A(0, V^) ^ /or some x G Xq} Xo = {x G X : 

0. G ^}. 
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These assertions follow quite easily from our definitions. We leave 
their simple and straight-forward (yet sometimes a bit laborious) veri- 
fication to the interested reader. 

Note also that, continuing with the assumptions and notations of 
Theorem 14. our analysis implies the following corollary: 

Corollary 4.5. The graph B{T,) is 2-connected if and only if the in- 
compatibility graph r(S) is connected. 

More generally, given any two maps tp, ip' E V{T.), one has T^{V^'^\ ip) 
= r^(rW, V^') for all (f) e - iP'} if and only if A(^, ^p') C Sq 

holds for some connected component Sq of the incompatibility graph 
r(S) of H. So, conversely, two maps ip,ip' € V^(S) are separated by 
some cut vertex (p G V{TP) if and only if A^ipjip') has a non-empty 
intersection with at least two distinct connected components o/r(E). 

Proof: We have already seen above that cannot contain a cut 

vertex in case r(S) is connected. 

Conversely, if r(E) is not connected, one may choose any two splits 
S, S' in distinct connected components of r(E) and then, according to 
Corollary 12. 6[ two maps ipjip' G V"(S) with S,S' G Alipjip'). Then, 
given any shortest path ipQ := ip, ipi, . . . , tp^ := tp' from tp to ip' in 
there must exist some i in {1, . . . ,k — 1} such that the two splits in 
the one-split sets A{'ipi,ipi_i) and A{ipi,ipi+i) are in distinct connected 
components of r(E). Thus, ipt must be a cut vertex in 

Moreover, if A{ip,ip') C Sq holds for some connected component Sq 
of the incompatibility graph r(S) and is any map in — {ip, ip'}^ 
we must have T^{V'^'l'\ip) = T^{V^'t'\'ip'): Indeed, Proposition (v) 
implies that r^(V^W,7/^) ^ r<^(VW,^') holds for some maps ip,ip' G 
V^'^^ if and only if the two sets A{ip,(l)) and A{(p,-ip') are contained in 
two distinct connected components of r<^(E). In turn, this implies that 
A{ip, Ip') must coincide with the disjoint union of A{ip, (p) and A(0, ip'). 
So, A{ip,ip') cannot be contained in a single connected component of 
r^(E) and, hence, even less in a single connected component of r(S) 
in this case. Thus, given any two maps ip,ip' G V{TP), V ^{y^'^\ip) = 
^<t>i^^'^\'4'') must hold indeed for any map (p in V{T,) — {ip, ip'} in case 
A{ip,ip') C Eq holds for some connected component Sq of the incom- 
patibility graph r(E). 

Conversely, if r^(yW,z^) = T^{V^^\iP') holds for all cp G - 
{ip,ip'}, we must have A{ip,ip') C Eq for some connected component 
Eq of the incompatibility graph r(S). Indeed, choosing as above any 
path ipQ := ip,ipi, . . . ,ipk ■= Ip' from ip to ip' in there would 

otherwise exist some i in {1, . . . , k — 1} such that the two splits in 
the one-split sets A{ipi,ipi_i) and A{ipi,ipi+i) are in distinct connected 
components of r(E). In turn, this implies that, for (p := ipi, we would 
have T fi,(V^'^\ ipi-i) ^ T^{V^'^\ipi+i) in view of Proposition H3](iii), and 
the fact (cf. Lemma [4.31) that distinct connected components of r(S) 
intersect S*^"^) in distinct connected components of r^(E'^"'^)) and, hence. 
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r^(VW,^) = r^(yW,^,_i) 7^ r^(yW,^,+i) = r^(y(^),^'), a contra- 
diction. I 

To conclude this section, we note that, essentially by definition, 
{S,(p^) e R^'^^ holds for every e V{E) and all S G S^'^). Thus, 
the obviously well-defined and injective map 7 : T,^'^^ — >■ F^^^ : S ^-^ (f)^ 
necessarily induces a map 710(7) from 7r^(E('^)) into 7r^{V'^'^^) that must 
coincide with the bijection 7r<^(E''''^^ K'-'^-'). So, wc have a diagram of 
bijections all of which arc "induced" by naturally defined maps between 
the corresponding vertex sets: 



7r^(SW^E) 



TTO (7) 



So, once established, these bijections could also have been used to de- 
fine all the other bijections between the sets 7r^{T,^'^^),7r^(T,),7r^{X^^^), 
and 7r^{V^'^^) which, however, we feel would have led to a much less 
transparent and natural approach. 



5. Blocks of the Buneman graph 

For a collection E of X-splits, we denote by c-ut(S) the set of all cut 
vertices of B{'E) and, as mentioned in the introduction, by -B/(E) the 
set of all of its blocks. In this section, we will describe a canonical 
bijection between Bl{T,) and 7ro(E), the set of connected components 
of the incompatibility graph of E. 

To this end, note first that, given any two distinct compatible X- 
splits S and S', there exists a unique subset A E S that we denote by 
A{S\S') such that An and An or, equivalently. A' G A 
or B' C A holds for the two subsets A', B' in 5". Clearly, given A e S 
and A' G S', we have 

(4.1) AUA' ^X ^ A = A{S\S') and A' = A{S'\S). 

(4.2) A' gA <^ A = A{S\S') and A' A{S'\S). 

(4.3) Ac A' <^ A ^ A{S\S') and A' = A{S'\S). 

(4.4) A'r)A^$ <^ A^ A{S\S') and A' ^ A{S'\S). 

(4.5) If G y(E) and S G E^, then 0(5') = A{S'\S) must hold 
for every spht 5" G E that is compatible with S. 



Proof: Just apply (4.2) and (4.4) to A := <j){S) and A' := 0(5'), 
noting that neither A' <Z A nor A' r\ A — ^ can hold. I 
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Similarly, suppose that S, S', S" are three distinct X-splits such that 
S is compatible to S' and to S". Then, as both A(S'\S) and A(S"\ 
S) properly contain one of the two sets in S, we have 

(4.6) A{S'\S) n A{S"\S) ^ 0, and 

(4.7) A{S\S') = A{S\S") whenever S' and S" are incompatible. 

Proof: A{S \ S') ^ A{S \ S") would imply A{S \ S") C 
A{S' \ S) (apply (4.3) with A := A{S \ S")) and hence, by 
symmetry, also A{S \ S') C A{S" \ S). Thus X = A{S \ 
S") UA{S\S') C A{S' \S)U A{S" \ S), implying that S' 
and S" must be compatible, a contradiction. i 

Thus, given any connected component Eq of the incompatibility 
graph r(S) of S and any split S* G S — Eq, the two subsets A(S'\S") 
and A{S \ S") in S are well-defined and coincide for any two splits 
6", S" e Sq. In consequence, we will also write ^4(5' \ Eq) for this 
subset in S. 

Now, with E a collection of X-splits as above, consider a connected 
component Eq G vro(E) of the incompatibility graph r(E) of E and 
associate, to any map 4> e V*(Eo), the map : E ^ ^(^) defined by 
putting 

(l){S) use Eo, 
A(5\Eo) else. 
We claim 



4>iS) :- 



Theorem 5.1. With E and Eq as above, we have G V{12) for every 
map (j) G V{Tiq). Moreover, the corresponding embedding 

e = : l^(So) ^ y(E) : ^ 

is actually an isometry from ^(Eo) into V{T) that maps ^(Eo) onto a 
block ^ 

5(Eo) = Bs(So) := {0 : G V(Eo)} 
of B(ll). Furthermore, we have 

(6) i?(Eo) = {0GV(E):V5es-So0(5) = ^(^\So)} 

= {0 G y(E) : E^"^) n Eo 7^ 0} 

and, hence, also 

(7) i?(Eo) = {0 G \/(E) : A(0, 0o) C Eq)} 

/or any map 0o G -B(Eo). And there exist a {necessarily unique) "gate" 
(j)Y.a G -B(Eo) for any map G V{T^), i.e., a map in -B(Eo) with 
D{^,il)) — D(0, 0So) + D(0so,'0) /o?^ '0 £ -S(Eo), which is given 
by 

° 1 A(S'\Eo) otherwise 

and must necessarily be contained in cut{Yl) in case ^ 5(Eo) holds. 
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In particular, given any two distinct connected components So, Si G 
7ro(S) of the incompatibility graph ofTi, the map defined by 

I A(S'\So) otherwise, 

is the unique gate 4>y,o ^ -B(So) of all maps 4> G And any map 

in the intersection -B(Eo) fl i?(Ei) of the blocks B{T^q) and B{T^i), if 
there is any such map, must he a cut vertex of B{T). Furthermore, the 
following assertions all are equivalent: 

{B n B : 1) There exists some (p G cut(T?) with (j) G B^Eq) n i?(Si). 

{BnB -.2) 5(So) n B{T.i)is non-empty. 

{B (1 B : 3) -B(So) fl -B(Si) is a one-vertex set. 

{BnB: 4) A{S \ Eq) = A(5 \ Ei) holds /or a// 5 G E - (Eq U Ei) . /. e., 
there exists no split S = {A, B} G S-(EoUEi) with AnAo = 
for some set Aq in some split Sq G Eq and B HBi — for some 

set Bi in some split 5*1 G Ei . 

And in case all of these assertions hold, the map 0So|Si ^ -B(Eo) coin- 
cides with the correspondingly defined map 0Si|So7 ^'^^ it is the unique 
map in S(Eo) fl 5(Ei). 

Finally, mapping each connected component Eq of the incompatibility 
graph of 12 onto the associated block B(J2q) induces a canonical bisection 

^ = *s : 7ro(E) ^ Si(E) : Eq ^ 5(Eo) 

from the set 7ro(E) of connected components o/r(E) onto the set Bl^E) 
of all blocks ofB{Y2). 

Proof: To simplify the exposition of the proof, we will present it as a 
series of 12 observations: (i) It follows immediately from the definitions 
and Assertion (4.6) that -B(Eo) is a subset of V^(E). It is also obvious 
that © is an isometry as, by definition, the even stronger assertion 

A(0, V^) = A(0, V') 

apparently holds for all 0, £ ^(^o)- Hence, the subgraph induced by 
B(E) on -B(Eo) is an isometric subgraph of S(E). 

(ii) We clearly have 

S(Eo) = {0 G V{T) : V5es-Eo0(^) = A{S\11^)} 

as 4){S) = A{S \ Eo) holds, by definition, for all G -B(Eo) and all 
S" G E — Eq. Conversely, given any map G V^(E) with 0(5') = A{S\i 
Eq) for all S* G E — Eq, we have ip := G V(Eo) and ip = (j). 

(in) Further, if 0(5') = A{S' \ Eq) holds for aU 5' G E - Eq, 
0(5') n A 7^ must hold for all 5' G E — Eq and subsets A in any split 
5 G Eq. Thus, ii (f) = ijj holds for some map ij G 1/(Eo), 5 G E(<^) must 
hold for any 5 G Eq for which 0(5) = '0(5) is a minimal subset in 
^/'[Eo] = {US') : S' G Eo}. So, we have indeed S(Eo) C {0 g ^(E) : 
E(^) n Eo 7^ 0}, as required. 
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(iv) Conversely, given any G ViT,) with T,^^'' fl Sq 7^ 0, Asser- 
tion (4.5) implies that one must have 0(5") = A{S'\S) for any split 
S G S*^'^'' and any split 5" G S that is compatible with S. Hence, 
4>{S') = yl(5"\So) must hold for any split 5" G S — Sq. Thus, putting 

•= 0|eo) we have ip ^ ^(^o) as well as (p = ip. So, G -B(So) holds. 
Thus, also {0 G 1/(E) : E^*) n Sq 7^ 0} C 5(Eo) and, therefore, also 
{0 G 1/(S) : S^"^) n So ^ 0} = 5(Eo) must hold, as claimed. 

(v) Next, choosing any fixed map 0o G i?(So), we have also -B(So) = 
{0GnS): A(0,0o)CSo)}as 

A(0,0o) C So ^ 0(5) = 0o(5) = A(5\So) 

holds for all 0o G 5(So) and G F(S). 

(vi) It is also obvious that the map 0sg as defined above is indeed 
contained in -B(So) and that, given any map G V(T,), 0Sq is indeed 
the (necessarily unique) map in -B(Eo) with D{(f),ip) = Z}(0, 0So) + 
-0(0205^^) all -0 G -B(So) and, hence, the "gate" of in B(T,q). 
And 0Eq G cut(T,) must hold in case ^ -B(So) as it 'separates' from 
i?(So). I.e., there must be edges incident with 02^ whose types must be 
contained in distinct connected components of r(S), those leading to 
and those leading to any map ip G -B(Eo) distinct from 02^ which must 
exist as the cardinality of -B(So) must be at least 2 for any connected 
component So of r(S). 

(vii) Next, given any two distinct connected components So, Si G 
7ro(S), let ip denote the map ip := 0So|Si : E — )■ V{X) that maps any 
split 5 G So onto y4(5\Si) and any other split 5 G S onto y4(5\So). 
It is obvious that the restriction -^Isq : S t-)- y4(5\Si) of ip to So is 
contained in V{T,q) in view of Assertion (4.6). And it is also obvious 

that ip coincides with the extension ip\j:^ of this restriction V'lso- 
is contained in i?(So). 

It is also the gate 0So in -B(So) of any map G -B(Si) as, by def- 
inition, also 0So(5) = 0(5') = y4(5\Si) = holds for all 5 G So and 
0^.^(5) = A(5\So) holds for all ^ ^ So. 

(viii) Furthermore, any map G -B(So) ni?(Si) must be a cut vertex 
of -B(S): Indeed, ([7]) implies that there must be edges incident with any 
such whose types are contained in So as well as edges whose types 
are contained in Si. Thus, the assertions {B (1 B : 1) and [B Ci B : 2) 
are indeed equivalent and follow from {B H B : 3). Further, if G 
5(So)n5(Si) holds, we must have 0(^) = A{S\Eo) for all S G S-So 
and 0(5") = A(5'\Si) for all 5" G S— Si. So, there can be only one such 
map, and A(5'\So) = A(5'\Si) must hold for all 5" G S — (So U Si) 
in case such a map exists. So, "(5 n B : 2) {B n B : 3)" and 



"(5 nB -.2) ^ {BnB : 4)" holds. 

(ix) And if (fi n : 4) holds, i.e., if A{S\Eo) = A(^\Si) holds 
for all 5* G S — (So U Si), there exists, of course, no split 5' = {A, B} G 
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S — (So U Si) with An Aq = ioT some set Aq in some split S'o G Sq 
and B n Bi = for some set Bi in some split Si G Si: Indeed, this 
would imply A(5\So) = B A(5\Si) = A. 

Furthermore, the map ip = 0So|Si coincides with the correspondingly 
defined map 0Si|So- Indeed, Assertion {B n B : 4) implies that both of 
these maps coincide: Both coincide with A(S'\So) = A(S'\Si) on 
all splits S" in S — (Sq U Si) while they both coincide with yl(S'\Si) 
if G So and with y4(S'\S9) if 5 G Si. So, must the unique gate 
in -B(So) of every map in -B(Si) and, simultaneously, the unique gate 
in -B(Si) of every map in B(T,2)- Thus, -B(Si) fl B(T,2) 7^ must hold. 
I.e., {B n B : 4) implies also {B H B : 2). So, all of the assertions 
{B n B : i) i = 1,2,3,4 must indeed be equivalent, as claimed. 

(x) Furthermore, as -B(So) is 2-connected (cf. Corollary 14. 5 1) , it follows 
from the definition of a block that -B(So) is a subset of some block 
B G BICE). However, -B(So) must, in fact, coincide with that block 
B: Indeed, 0o ^ B(T,q) and (f)i E B imphes that, by the definition of a 
block, r^(V^(<^),0o) = r^(\/W,0i) must hold for all G y(S)-{0o, 0i}- 
Thus, by Corollary 14. 5[ there must exist some connected component Sq 
of the incompatibility graph r(S) of S with A(0o; 0i) ^ ^o- Moreover, 
by replacing 0o by any map 0q G -B(So) with 7^ A(0o, <Po) if necessary, 
we may also assume that A{(j)o, (f)i) n Sq 7^ must hold and, therefore, 
So = Sq 3 A(0o5 0i)- This, in turn, implies that also 0i G i?(So) and, 
therefore, also B = -B(So) must hold. 

(xi) We clearly have -B(So) 7^ -B(Sq) for any two distinct connected 
components Sq, Sq in 7ro(S), (as, e.g., A(0, ■?/') C So and A{(f)',ip') C Sq 
must hold for any (f>,ip E -B(So) and 4>',ip' G ^(Sq)). Thus, the map ^ 
is injective. 

(xii) And finally, \E' is also surjective as there exists, for any block 
B G BI(Tj), some connected component Sq G 7ro(S) of r(S) with 
-B(So) = B. Indeed, choose any edge e := {0i,02} in B, let 5* be the 
unique split in A(0i, ^2), and let So = r(S, S) be the connected com- 
ponent of r(S) that contains the split G S. Then, 5" G S^'^i) n S(<^2) 
implies, in view of Assertion (4.5), that 0i, 02 G -B(So) must hold. How- 
ever, according to well-known properties of blocks, this can be true only 
if i? = -B(So) holds. This completes the proof of the theorem. 1 

6. The X-tree associated with S 

Let us recall first that, given any connected simple graph V = {y,E) 
with vertex set V and edge set E C the graph T(r) with vertex 
set V(r) the disjoint union -B/(r) H V" of the set -B/(r) of all blocks of F 
and the set V of all of its vertices, and edge set £{T) the set of all pairs 
{B,v} with B G -B^(F) and f G -B is well-known to always be a tree 
(cf. P Proposition 3.1.2]). Clearly, the degree deg7-(r)('^) of vertex 
V E V, considered as a vertex in T(F), coincides with the number of 
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blocks that contain it, and the degree deg7-(p)(-B) of any block B of F, 
considered as a vertex in T(r), coincides with the number of vertices 
it contains. 

In consequence, continuing with the notation introduced so far, every 
collection S of X-splits gives rise to a tree T(S) := T(-B(S)) with 
vertex set V(S) := 11 V{Ti), the disjoint union of Bl(ll) and 

V{T), and edge set the set £{J^) of all pairs {B, 0} with B G Bl{T) and 
(f) & B. Moreover, the canonical labelling map (p^, : X ^ ^(S) '■ x ^ (p^ 
from X into the ^(S) can also be viewed as a labelling map from X 
into the vertex set V(S) of T(S). 

Note further that the degree deg7-(2)(-B) of any block B of 
considered as a vertex in T(S), coincides with the number of vertices 
it contains. In addition, the degree deg7-(2)(0) of any vertex G V(T,) 
of -B(S), considered as a vertex in T(S), coincides with the number 
of blocks that contain it and, hence, with the cardinality of 71^(2*^'^^), 
TT^{T.), 7r^{X^'^^) as well as of ir^j^CV^'^^). In particular, a vertex G V{T.) 
is of degree larger than 1 in T(S) if and only if it is a cut vertex of 

In consequence, it is fairly obvious that we can also associate, to 
any system S of X-sphts, a "reduced" tree which is a "proper" 
X-tree: All one needs to do is (i) to delete all non-labeled vertices 
(j) G V(S) with (f) G V(T,) — cMt(E) and the pendant edges leading 
to them and (ii) to suppress all vertices of degree 2, i.e., to replace 
each maximal sequence Uo,Ui, ... ,Uk of distinct non-labeled vertices of 
T(S) with {ui^i,Ui} G S(T,) for all 2 = l,...,k and degrp^{ui) = 2 
for alH = 1, . . . , A; — 1 by just one edge {uq, Uk} while simultaneously 
deleting all the vertices ui,U2, ■ ■ ■ , Uk-i in between uo and Uk and the 
edges incident with them, see also Figure [2l 

Thus, a block B in of the form B = i?(So) for some Sq G 

7ro(S) will be suppressed if and only if So consists of a single split, 
only. Otherwise, its degree degj.^(i?) in the "reduced" tree coincides 
with the number of equivalence classes of the equivalence relation ~So 
defined on X by 

X ~Eo y V5gSo'^(a;) = S{y). 

In particular, the labelling map 0s sets up a bijection from X onto the 
set of leaves of T(S) if and only if cut(J2) fl {(px ■ x G X} = and the 
equivalence relation ~e is the identity relation on X, i.e., flsgs ^i^) ~ 
{x} holds for all x G X. 

Altogether, this implies 

Theorem 6.1. Suppose as above that X is a finite set of cardinality 
at least 2, and that is a collection of X -splits. Then, the following 
holds: 

(i) The tree T{Tj) = T(i3(S)) canonically associated with the graph 
i3(E) is isomorphic to the graph with vertex set the disjoint union of 
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(>) (b) 

Figure 2. (a) The graph T(S8) corresponding to the 
Buneman graph B{T,s) in Figure [H where white ver- 
tices correspond to elements in the set ^/(Sg) and black 
vertices to those in V(T,s)- (b) The X-tree Tsg ob- 
tained from the tree T(S8) by deleting all non-labeled 
vertices (j) e V(E) with G V{T,) — cut(J2) and the 
pendant edges leading to them and suppressing all ver- 
tices of degree 2 in T(T,s). Note that the split system 
{Si, S2, S3, Si23, S4, S5, Sq7s, Se, Sj, Ss} gives rise to the 
same X-tree. 

7ro(S) and V(T,), and edge set 

{{So, 0} : So G 7ro(S), G V(S), S^ n So 7^ 0}. 

(ii) The graph obtained from the tree T(S) by deleting all non-labeled 
vertices G V"(S) — cMt(S) and the pendant edges leading to them 
as well as "suppressing" all non-labeled vertices of degree 2 in T(S) 
is, together with the induced labelling map from X into its vertex set, 
an X-tree that we denote by T^. It is canonically associated with S 
and coincides with that X-tree that, according to Peter Buneman, is 
associated with S in case S is compatible. 

(iii) Furthermore, given any distinct blocks Bi, B2 G i?/(S) for which 
a cut vertex G -Bi fl i?2 exists, one has 

max(deg^(s)(^i),deg^(2)(S2),deg^(s)(0)) > 3, 

that is, at least one of the three vertices Bi,B2, and ofTiT?) must, 
for every such triple Bi,B2, and 0, also be a vertex of the X-tree 
derived from T(S). 

Proof: The first two assertions follow immediately from our previous 
observations. To establish (iii), note that since Bi 7^ B2 we have 
degr(s)(5i),deg7-(s)(S2) > 2. But deg7-(s)(5i), deg7-(s)(52) = 2 would 
imply that there exist two distinct vertices 1^1,1^2 ^ ^(S) ~ {0} with 
Bi = {ipi,(f)} and B2 = {-025 0}- Let 6*1, 6*2 G S denote the splits 
for which A('0i,0) = {5*1} and A('02,0) = {5*2} both hold. Then 
AiU A2 = X must hold for Ai := 0(S'i) and A2 := 0(5*2). This, in 
turn, implies that fl A2 7^ must hold as well as Si, S2 ^ A(0, 0a,) 
for every x E Ai (1 A2. Hence, in view of Lemma [4.21 and Theorem 15. 
r^(S'^'^)) contains at least three connected components: {^i}, {5*2}, and 
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the connected component containing Amin(0|0a;) for some x G Air\A2. I 

As mentioned already above, we will explore these matters in more 
detail in [15] . We will consider in particular the graph theoretical invari- 
ant, defined according to [13], of the Buneman complex'QiTj) associated 
to S as defined in [12]. And we will show that as well as B(S) can 
be described as a push-out in terms of the data involved in the con- 
struction of Ts suggesting efficient algorithms for their computation 
(see also [13 [IS]). 

In yet another paper, we will discuss what can be done in case 
the incompatibility graph r(S) is connected and -S(S) is, hence, 2- 
connected. In particular, by directing attention towards cut faces rather 
than merely cut vertices of we show that it may be possible even 

in this case to extract valuable phylogenetic information from a split 
system E. 
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